Classical simulation of limited-width cluster-state quantum computation 
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We present a classical protocol, using the matrix product state representation, to simulate cluster- 
state quantum computation at a cost polynomial in the number of qubits in the cluster and expo- 
nential in d - the width of the cluster. We use this result to show that any log-depth quantum 
computation in the gate array model, with gates linking only nearby qubits, can be simulated 
efficiently on a classical computer. 
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The model of cluster-state quantum computation pro- 
posed by Raussendorf and Briegel |l| has attracted much 
interest in recent years. This model demonstrates some 
remarkable features, most notably the fact that once a 
particular multi-qubit state (the cluster-state) has been 
prepared, the whole processing of the data is imple- 
mented by measurements of individual qubits and feed- 
forward (that is, the measurement carried out on one 
qubit may depend on the outcomes of previous measure- 
ments). In general the cluster-state is thought of as a 
two-dimensional grid of qubits, entangled by applying 
CPHASE operations between neighbouring pairs. Due to 
this entangled structure, any quantum circuit can be sim- 
ulated by cluster state computation Q] with a polynomial 
resource overhead (in terms of the number of qubits and 
elementary operations). Yet, the computational power 
of a cluster does not depend only on the amount of re- 
sources that were invested in its construction, but also on 
its geometry. It was shown by Nielsen that any compu- 
tation implemented with a linear cluster (a single chain 
of qubits) can be simulated efficiently on a classical com- 
puter Q. 

In this paper we analyse the computational power of 
limited- width cluster-states. We show that any computa- 
tion on a cluster-state in the shape of a rectangular grid 
can be simulated on a classical computer at a cost which 
is quadratic in the number of qubits and polynomial in 
2 d , where d is the width of the cluster. Therefore, if we 
limit the width so it scales like the log of the number 
of logical qubits, then any computation on that cluster 
can be simulated efficiently. We will generalize our proof 
to include also clusters in which the connections between 
qubits are not necessarily between nearest neighbours, 
but are bounded by some constant length. 

Since any quantum computation can be implemented 
by cluster-state computation our results imply that any 
quantum computation in the gate array model with depth 
that scales like log of the number of qubits, and where the 
range of the interactions is bounded by a constant (that is 
two-qubit gates are applied only to qubits which are not 
too far apart) can be efficiently simulated on a classical 
computer. Similar results concerning the computational 
power of limited-depth quantum gate arrays have been 
reported recently 0] following from a completely different 
approach. 



Our approach relies on the representation of the 
cluster-state as a matrix product state (MPS). We shall 
follow here the construction of Vidal |4( through which 
the matrix representation of the state of n qubits (\^)) 
is obtained by a sequence of n — 1 Schmidt decompo- 
sitions. These Schmidt decompositions relate to a cer- 
tain ordering of the qubits, where the k th decomposition 
corresponds to a partition of the system into the first k 
qubits and the remaining n — k qubits. Let us first write 
the state of the system in the computational basis 
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The key point of the construction is the representation 
of the coefficients C^...^ as a product of n tensors (F^l) 
and n — 1 vectors (A^) 



C, 



E 



r [l]u A [l] r [2]i 2 \[2]-p[3]i 3 



•rktlv (2) 



Each index goes from 1 to the number of terms in 
the k th Schmidt decomposition (the Schmidt number), 
and the elements of the vector are the correspond- 
ing Schmidt coefficients. Note that the Schmidt number 
(or the log of this number) relating to a partition of the 
system can be seen as a measure of the entanglement be- 
tween the two parts 0] , and as such cannot increase under 
local operations and classical communication. We denote 
the maximal Schmidt number over all n — 1 Schmidt de- 
compositions by x- 

Let us now discuss in brief Vidal's construction pro- 
cess. One starts by expressing \^) using the first Schmidt 
decomposition (corresponding to the partition into qubit 
1 and the rest). 
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Expressing the Schmidt vector I^L 1 ]) in the computa- 
tional basis we obtain 
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In the next step the states |<J>aj ""') are expressed in terms 
of the computational basis states of the second qubit and 
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of the Schmidt vectors |$L 2 ) corresponding to the sec- 
ond Schmidt decomposition, to obtain 

I*) = E^^rL^AiH!)^}!^ 3 :'" 1 )- (5) 

This process can be repeated qubit by qubit until the rep- 
resentation in J2| is obtained. The crucial point here is 
that one can always express the Schmidt vectors |$a fc _™') 
of qubits [k, . . . ,n], obtained by the (k — l) th Schmidt de- 
composition, in terms of the Schmidt vectors |<l>a t +1 "'), 
obtained by the k th Schmidt decomposition. Hence we 
can always write 

l*K> = E rLtia^Ilu)^^ 1 -" 1 )- (6) 

ikak 

It is fairly easy to see that if this was not the case and 
one of had a component outside the subspace 

spanned by the states |«fc)|$Q fc +1 then the overall 
state 1^) could not lie within the subspace spanned by 

I^L 1 ,"^)!*^ 1 "" 1 ), in contradiction to the k th Schmidt 
decomposition. 

A description of the state of our system in terms of the 
r^'s and A^'s would require approximately (2\ 2 + x) n 
parameters, instead of the 2™ coefficients (Cji-..i„) re- 
quired to represent the state in the computational basis. 
The parameter which determines the size of the descrip- 
tion of the system is therefore x- In general x is of order 
2™ and the MPS representation is not very useful. How- 
ever, if the state does not carry much entanglement then 
X may be smaller and the MPS representation may be 
advantageous. In particular if x scales like poly(n), than 
one would have an efficient description of the state in- 
volving only poly(n) parameters. 

Since cluster-state computation involves only single- 
qubit operations, it is easy to see that the Schmidt num- 
ber associated with any partition cannot increase, and 
hence that x "vvill not increase during the computation. 
Therefore, if the initial cluster-state has an efficient MPS 
representation all later states can also be represented ef- 
ficiently. Furthermore, we show that these later repre- 
sentations can be obtained efficiently, and hence prove 
the following: Any computation consisting of (projective) 
single-qubit measurements and feed-forward on a system 
of n qubits, where the maximal Schmidt number for all 
bipartitions of the system along a certain ordering is Xj 
can be classically simulated at a cost of 0(n 2 poly(x)) in 
computational time and memory space. 

In order to simulate a single-qubit measurement we 
need to calculate the probabilities for the two outcomes, 
sample from the probability distribution, project the 
state of the system accordingly and then renormalize the 
projected state. An arbitrary single-qubit measurement 
can be implemented by first applying a single-qubit uni- 
tary, and then measuring the qubit in the computational 



basis. Let us examine the effect of a single-qubit unitary 
acting on the fc th qubit on the MPS representation of 
the state. Clearly each of the computational basis states 
would undergo the following transformation 

|i*>— ►E c ^1 i *>- ( ? ) 

This operation can be incorporated into the tensors cor- 
responding to the k qubit - ra]a H1 - by replacing them 
with 

1 a k a k+1 ~ U 0tk L a k a k + i + u ^k L a k a k+1 • \° ) 

Updating the MPS representation after a single-qubit 
unitary would therefore take 0(x 2 ) basic operations. 

The probabilities of the outcomes in the computational 
basis can be easily calculated from the following repre- 
sentation of the state of the system, obtained by using 
the (k — l) th Schmidt decomposition, and equation @), 

E E ALt^Lt^.AWl^ir 11 )!^)!^'^)- 
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The probability p(ik) for receiving the outcome ik in a 
measurement of the k qubit is therefore obtained from the 
tensor r^^o^ and the vectors Xa*-} and Aq|. Defining 

Ah = \[k] (-ir\) 

and using the orthogonality of the Schmidt vectors, we 
have 

pfo)= E \ A t- ia f- en) 

"not 

Sampling from this probability distribution, and re- 
ceiving outcome |rfe), the state of the system after pro- 
jection and renormalisation will be |n £ )|'I''}, where 

|*'\ = 1 . \ " A rk |<j>[ 1 '-- fe -i]\|<T>[fc+i---™]\ 

VP( r k) a k _ iak 

(12) 

In what follows, we leave out the state of the measured 
qubits (which remain in a product state with the rest of 
the system), and consider the MPS representation of the 
remaining qubits. This representation must now be up- 
dated since the A's and T's above do not correspond to 
Schmidt decompositions of 1^'), and in order to be able 
to calculate the probability distribution for the measure- 
ment of the remaining qubits efficiently, the correct MPS 
representation must be recovered. Indeed as we consider 
a general n-qubit state (where the measurement of one 
qubit might affect all other qubits) all of the T's and A's 
must be updated. Tracing over qubits 1, . . . , k — 1 we 
obtain the x by X reduced density matrix p[ fc+1 ""1 

Pa k a' k ^ = E A a h -ia k ( Ar a k _ ia ' h )* ■ ( 13 ) 
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Given the reduced density matrix we can its find eigen- 
values Afl , which are the Schmidt coefficients for 

Pk — 1 

the above partition. We can also find its eigenvec- 
tors Mp k _ l0lk , which represent the new Schmidt vectors 

m ^ ne basis of the old Schmidt vectors: 



To calculate the Schmidt coefficients (A^j) for the next 
partition, between qubits [1, . . . , k — 1, k + 1] and [k + 
1, . . . , n] , we write the state of system as follows 

= E ^^aj*^ 11 )!*^) (15) 

^fc-i a fc+i 
•fc+i 



where 
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We can now write the reduced density matrix of qubits 
[k + 2, . . .n] in terms of the tensor B 

[k+2-n] = y B i k+1 , B i k+1 y 

0k— l^fe+i 

Having calculated pl fc + 2 "'™] we can now find its eigen- 
values A^' (the Schmidt coefficients) and its eigenvec- 
tors Mp liak+1 , which represent the new Schmidt vec- 
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in the basis of the old Schmidt vectors 
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The relation between l*^ 2 '"" 1 } and \^ l ^" ni ) defines 
corresponding to qubit k + 1. 
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the new tensors T 
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Examining 10 we see that 
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where we have used (|f 4|l and the corresponding relation 
for l^ 2 '"" 1 ) in the last line. 

We can proceed in the same manner to obtain all the 
reduced density matrices in one direction (p[ fc + 3 '"1 to 
p[™l) and their eigenvalues and eigenvectors. At each 
step the reduced density matrix is given as a function 
of the eigenvectors and eigenvalues obtained in the pre- 
vious step (as well as the old T's and A's). The new T's 



are found using the eigenvectors of two consecutive steps 
as in I jl 8[> . In order to update the MPS representation in 
the other direction we first express |$Jg ) in terms of 

using {T2Jl, {H} and 
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then re-express these states in terms of |$oi"a 2 ')l*fc-i) 
using 

l^a k -l I a k -2 L a k -2a k -l I ^a k -2 /l'fc-1/- 

(20) 

Expanding \ty') as in (|15H using (|19|l and l|2UI) . and tracing 
over qubits [k — 1, k + 1, . . . ,n] we obtain pi 1 -**- 2 !. The 
eigenvalues and eigenvectors of this matrix can be used to 
obtain 2 ' and f jg fc ^g* - * m a s i muar wa Y to before, 
and repeating this procedure we can obtain all of the 
remaining T's and A's. 

In each step of the updating procedure we deal with a 
constant number of (at most) x by X matrices, requiring 
poly(x) basic operations, and hence updating the whole 
state after a measurement requires n poly(x) basic oper- 
ations. As we can apply at most n single-qubit measure- 
ments to the state, the overall computation can be simu- 
lated with 0(n 2 poly(x)) computational resources. Note 
that we do not include the cost of computing the feed- 
forward from the measurement results, as this is common 
to both the quantum computation and the classical sim- 
ulation (and for standard cluster-state computation can 
be computed efficiently). 

In the above procedure we simulate the single-qubit 
measurements in the same order as they are measured in 
the actual computation. This is usually independent of 
the order in which qubits in the MPS representation are 
numbered, which we choose so as to minimize x- How- 
ever, if we can number the MPS representation in the 
same order as the qubits are measured without signifi- 
cantly increasing then the simulation can be consid- 
erably simplified. In this case we do not have to update 
the MPS representation after the measurements, as the 
probability distribution can be calculated directly from 
the projected state. The cost of the simulation in this 
case would be 0(nx 2 )- 

Let us now consider a cluster state in the shape of a 
rectangular grid of width d and length I > d where each 
qubit is entangled to all its nearest neighbors. In order 
to construct our MPS representation, we choose the fol- 
lowing ordering of the qubits: We start from the qubit 
in the top-left corner and number the qubits column by 
column until we reach the qubit in the bottom-right cor- 
ner. In this case, the maximal Schmidt number x f° r 
all bipartitions of the system is 2 d . To prove this, we 
consider a typical bipartition, and an incomplete cluster, 
similar to the original but where the 2d qubits next to 
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FIG. 1: A rectangular cluster state. The solid lines repre- 
sent entanglement (CPHASE connections), the thick dashed 
line shows a typical partition of the cluster, and the dotted 
lines show the connections that do not exit in the incomplete 
cluster. 



the partition are only entangled with each other (and not 
with the rest of the cluster), as shown in figure 1. The 
original cluster can be constructed from the incomplete 
cluster by local unitary operations (that is, operations 
which act on only one side of the partition) and there- 
fore both have the same Schmidt number S with respect 
to this partition. The state of the incomplete cluster can 
be written as | v E' j 4}|-F , }| v I'b), where \P) stands for the en- 
tangled state of the 2d qubits next to the partition and 
\^a/b) are the states of the remaining qubits on the left 
and right sides respectively. Since, \P) contains only 2d 
qubits, it is clear that the S < 2 d . In fact one can easily 
verify that S = 2 d , since \P) itself can be constructed 
from d maximally entangled pairs of qubits by local uni- 
tary operations. As no bipartition has a greater Schmidt 
number than S, we have % = 2 d . 

In general we expect both d and I to scale as poly(iV) 
where TV is the number of logical qubits, and hence the 
above simulation would require exponential resources. 
However, any cluster-state computation implemented by 
a grid of physical qubits with limited width, that is, 
where d scales like (at most) \og(N), can be simulated 
at a polynomial cost in time and memory space. 

We can also extend our approach to more general 
cluster-states, in which non-neighbouring qubits are con- 
nected by CPHASE operations, as long as these connec- 
tions have limited range. Consider a cluster where the 
vertical distance between connected qubits (across the 
width d) is not limited, and the horizontal distance is 
limited by r. That is, a qubit in column k may be con- 
nected to any qubit in columns {k — r, . . . , k+r}. Looking 
at a typical partition where the left side consists of the 



first k — 1 columns and part of the k column, and the 
right side contains the rest of the cluster, we consider an 
incomplete cluster where the block consisting of columns 
{k— r, . . . , k+r} is isolated from the rest of the cluster (i.e. 
in the incomplete cluster the qubits of this block have the 
same connections between themselves but no connections 
to the rest of the cluster). As in the previous case, the 
state of the incomplete cluster is given by |\E , a)|-P')I^'b) 
where \P') is the state of the isolated block. \P') con- 
sists of only (2r + l)d qubits, so the Schmidt number for 
the partition must be less than 2( r+1 / 2 ) d . As before, the 
original cluster can be recovered by local CPHASE oper- 
ations, hence \ 5= 2^ r+1 / 2 ^ d . It is therefore clear that any 
cluster-state computation on a rectangular grid can be 
efficiently simulated with a classical computer as long as 
either d (the width of the cluster) or r (the range of the 
connections) scales like \og(N), and the other is constant. 

In our general procedure we did not specify to which 
qubits the input is introduced and in which order the 
measurements are performed in the computation. Given 
an efficient MPS representation of the initial state of 
the cluster any computation can be simulated efficiently. 
Thus, considering the I x d rectangular grid above, we 
can allocate a column of physical qubits for each logical 
qubit (measuring the physical qubits, say, from top to 
bottom). Simulating a quantum gate array in this way, I 
would be proportional the number of logical qubits N, d 
would be proportional to the depth of the computation 
(i.e. the number of time-steps), and r would be propor- 
tional to the range over which gates can act. Therefore, 
we can also state that any quantum computation in the 
gate array model where either the depth of the compu- 
tation or the range of the interaction scales like log (AT), 
while the other is bounded by a constant, can be effi- 
ciently simulated on a classical computer. 
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